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Abstract: 

A decision maker (DM) has some funds invested through two investment firms. 
She wishes to allocate additional funds according to the firms’ earnings. The DM, 
on the one hand, tries to maximize the total expected earnings, while the firms, 
on the other hand, try to maximize the overall expected funds they manage. In 
this paper we prove that, for every market, the DM has an optimal bonus policy 
such that the firms are motivated to act according to the interests of the DM. 
On the other hand, we also prove that the only policy that is optimal in every 
market, is independent of the actions and earnings of the firms. 
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1 Introduction 


Bonus plans have an important rule in motivating agents in competitive markets, and espe¬ 
cially in financial markets. In theory, these plans are intended to guarantee that financial 
managers are investing their best time and effort to produce the optimal possible earnings 
for their investors. 

Though the motivation for these bonuses is clear, their effects are relatively vague. When 
the outcomes are stochastic and the plans are based on past performance, the agentj^ can 
take unnecessary risks, from the investors’ perspectives, to increase their own expected payoff. 
Therefore, the formulation of incentives plans has to ensure that agents are acting according 
to the best interests of the people they represent. 

In the current paper we address this problem through a basic set-up. We assume that 
a decision maker (DM) wishes to invest additional funds using several firms. To do so, 
she chooses an allocation rule, a bonus plan, that is a function of the firms’ outcomes. For 
example, a DM has $100 thousand invested through two firms and she wishes to allocate an 
additional amount of $50 thousand as a function of the earnings they produce by the end 
of the year. Another example - the DM is a manager in an investment firm that needs to 
distribute additional funds to several departments based on their annual return. 

The motivation of the firms to maximize the total expected funds they manage could pose 
a problem, as it may not coincide with the DM’s wishes to maximize the expected return. 
As it may occur in real-life, the firms can take advantage of the imperfect monitoring of their 
actions to increase their own expected payoffs, at the expense of the DM. 

The results of this paper capture this problem in two aspects. First, we prove that for 
every market, i.e., for every set of possible actions of the firms, the DM can fix an optimal 
bonus plan such that the firms, in equilibrium, act according to the preferences of the DM. 
This result applies when the market is relatively stationary, in the sense that the set of 
actions does not change drastically. The proof we present in constructive and holds for a 
general number of firms and actions. A bonus plan that is linear in the sum of differences of 
the earnings of the firms proves to be optimal. 

On the other hand, we also prove that a universal, non-trivial, optimal bonus plan does 
not exist. In other words, when the market changes frequently and the set of actions can 
change considerably, one cannot devise a bonus plan which remains optimal, unless it is 
independent of the outcomes of the firms. 

As these results indicate, we focus on the existence of an optimal bonus plan, given that a 
^We sometimes refer to the firms as agents, and to the decision maker as investor. 
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bonus plan is needed, rather than discussing the necessity of the plan itself. We assume that 
agents who are indifferent between the outcomes they produce, due to the lack of incentives, 
will almost surely lead to a sub-optimal result. For example, firms without results-based 
payoffs, either positive or negative, will have no reason to invest in R&D, expand to new 
markets, gain private information or even improve their trading capabilities. 

This perspective defers our research and previous papers. In most previous works (see 
Subsection 1 1.1 1 for a detailed description), the main question is whether a DM can design an 
incentives plan such that skilled agents are rewarded while unskilled agents are screened out. 
We, however, show that even when all the agents are skilful, every results-based bonus plan 
can lead to a sub-optimal outcome. Thus, individual investing capabilities or distinct private 
information, will have no positive effect on the agents’ chosen portfolios. 

Another aspect that differs our work and previous ones is the usage of a benchmark 
portfolio. As we compare the actual outcome of every agent to the potential outcome, 
given the agent’s abilities and information, we do not require an exogenous benchmark. 
Nonetheless, we show that a bonus plan which is linear w.r.t. the sum of the differences in 
revenue of the agents is optimal, meaning that a comparison between agents is essential. 

The implications of our results are not limited to the motivating example of additional 
funds divided between competing firms. Our model also applies to cases where an investor 
wishes to reallocate her previously-invested funds, according to future performance. There¬ 
fore, the allocation rule the investor uses, could imply actual bonuses, as well as penalties, 
and any combination of the two. 


1.1 Related literature 


Numerous studies were conducted over the importance of performance-based payoffs and 
reputation in non-deterministic markets. Yet, to the best of our knowledge, the aspect we 
examine was never discussed. 

Many papers examined the well-known ’’herding” phenomenon, where agents tend to 
mimic other agents, although they can preform better with a certain probab ility, in order t o 
conserve their reputation. One can track back this concept to the words of iKevnesI (| 19361 ): 
’’Worldly wisdom teaches that it is better for reputation to fail conventionally than to succeed 


un conventionally.” (Book 


, Ch apter 12, page 158). 


Scharfstein and SteinI (| 199(1 ) follows this argument and shows, under certain circum¬ 


stances, that agents simply mimic the behaviour of others, ignoring substantive information 
they posses. Specifically, when the capabilities of an agent are assessed by the market ac- 
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cording to his performance, as well as his conventionality, the agents may mimic each other, 


lea ding to a sub-optima . 


outcome. 


Dasguota and Prati (|2006l l continues this line of work. It studies the effect of career con¬ 


cerns on the performance of fund managers. Given a specific model, Dasgupta and Prat show 
that without career concerns, only fund managers with special abilities trade. However, they 
also prove that career concerns may lead to an equilibrium where unskilled fund managers 
have to trade in order to stand out, which may involve taking unnecessary risks. This work 
is generalized to a dynamic model in lDasguota and Prati (|2008l l. However, the latter focuses 
on the effect of career concerns on market prices and trading volume. In both cases the 
investors are risk-neutral return maximizers. 

Although the ’’herding” phenomenon is not our main focus in this paper, it is consistent 
with our results. We prove that a potentially suboptimal outcome exists, because of the need 
of agents to level their performance with the ones of their competitors. 

In general, the manipul ability a bilitie s 
than a few papers, such as iLehreii (200lh: 


(1200811: and 


Sandroni et al 


(2003 

); 

Sandroni 

(2003 

); 

Shmava 


Olszewski and Sandronil (|2008l ). 


Foster and Young 


( 201 oi l 


proves that it is almost impossible to generate a bonus plan 
such that skilled agents are rewarded while unskilled ones are eliminated from the market. 


This result is b ased on t 
able. Recently, iHe et al. 


r e assu mption that the agents’ strategies and tactics are not observ- 
(j2015l l showed that this result could be altered when a liquidation 
boundary is set along with requiring the agents to deposit their own money to offset the 
pot ential losses. _ 


Holmstrom and Milgroml (|l99ll ) studies the principle-agent problem in a completely dif¬ 


ferent set-up than ours, yet it reaches a similar conclusion that sometimes the only optimal 
incentives plan is constant. It proves that agents would divert their effort to where it is easier 
to measure their performance in order to increase their payoff and derives implications with 
respect to job design. For example, teachers that receive bonuses according to the students’ 
test scores, might neglect other important aspects. 

Assuming that some actions are known to the risk-neutral investor, ICarrollI (j2015l ) ex¬ 
amines contracts where the investor is evaluating the performance of the agent through a 
worst-case criterion consistent with her own knowledge. Under these conditions, it shows 


that linear contracts are optimal. Nevert 
our proposed linear bonus plan. In ICarroll 


r eless, one should not confuse this linearity and 
([20151) the conclusion is that a fixed share of the 
return is optimal, whereas we suggest a contract where the linearity is taken with comparison 
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to other agents. 


1.2 Outline of the paper 

The paper is organized as follows. Section [2] presents a simple 2-firm bonus-plan problem 
that illustrates the drawbacks of results-based incentives in competitive non-deterministic 
markets. In section [3] we present the model along with the main assumptions. Section H] 
includes the main results divided into two parts: In Subsection 14.II we show how to formulate 
an optimal bonus plan for any specific market; In Subsection 14.21 we prove that every bonus 
plan can fail as the market evolves, unless it is independent of the outcomes. Concluding 
remarks and additional comments are given in Section [5l 

2 A motivating example - a 2-firm bonus-plan problem 

A decision maker (DM) wishes to invest some funds through an investment firm. There are 
two possible investments firms, Firm I and Firm 2. The DM has already some funds invested 
through both of the firms and she wishes to allocate the additional funds according to some 
predetermined rule that depends on the firms’ yearly earnings. 

The goal of the DM is to maximize the expected earnings of past and future investments. 
However, as she is not aware of the possible bonds in the market, she chooses to allocate 
the entire available amount to the company which presents the highest earnings by the end 
of the following year. In case both firms present the same earnings, the funds are divided 
equally between the two firms. 

The firms can invest either in Bond Xi, which gives 5% per year with probability (w.p.) 
1, or in Bond X 2 , which gives 5.1% per year w.p. 0.6 and 0% per year w.p. 0.4%. The goal 
of the firms is to maximize their expected earnings and to maximize the overall funds they 
manage, according to the utility functions given below. 

In this example we prove that, although investing in Bond X 2 is suboptimal in expectation 
relative to an investment in Bond Xi (and also riskier), the unique equilibrium is when both 
firms invest in Bond X 2 . 

Formally, let A = {Xi, A 2 } be the set of actions of the firms. We consider the actions Xi 
and X 2 to be random variables on the same probability space with the distributions, 

per year w.p. |, 
per year w.p. |. 


Xi = 1.05 per year w.p. 1, X 2 = 
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These distributions are common knowledge of the firms. Fix A e (0,1) and define Ui and U 2 
to be the utility functions of Firm 1 and Firm 2 respectively such that 


and 


Ui{Xi,X,) = XXi + {l-X) 


^{Xi>Xj} + 




U2iXi,X,) = XXj + {l-X) 




^{Xj=Xi} 


for every action Xi of Firm 1 and for every action Xj of Firm 2. That is, the utility functions 
are a weighted average of the earnings and the additional funds distributed by the DM. The 
following lemma shows that the optimal action of both firms is X 2 ■ 


Lemma 1. The unique equilibrium is when both firms choose to invest only in Bond X 2 . 

Lemma [T] exemplifies an interesting situation. The expected earnings per year of Bond 
X 2 is 3.006% while the expected earnings per year of Bond Xi is 5%. Therefore, all the 
parties involved, firms and the DM, would benefit in case both firms choose to invest in Xi. 
Nevertheless, the unique equilibrium is (X 2 ,X 2 ) and this market fails to reach an optimal 
outcome. 

Proof. Let us compute the expected utility of Firm 1 for every profile of pure actions. 
There are four profiles we need to consider and the expected utility of Firm 1 in each is: 


E[Ui{Xi,Xi)] 

E[Ui{X2,Xi)] 

E[Ui{XuX2)] 

E[Ui{X2,X2)] 


1.05A + 0.5(1 - A) = 0.5 + 0.55A; 
1.05A + 0.6(1 - A) = 0.6 + 0.45A; 
1.0306A + 0.4(1 - A) = 0.4 + 0.6306A; 
1.0306A + 0.5(1 - A) = 0.5 + 0.5306A. 


A similar computation holds for Firm 2. Inserting these expected values to a 2-player game 
the game presented in Table [TJ 




Ala 


0.5-h0.55A, 0.5-h0.55A 

0.4 -h 0.6306A , 0.6 0.45A 

Ala 

0.6 0.45A, 0.4 0.6306A 

0.5 0.5306A, 0.5 0.5306A 


Table 1: The game played by Firm 1 and Firm 2 as a function of A. 

For every A e (0,1) and for every Firm i e {1,2}, action X 2 strongly dominates action 
Xi and the result follows. ■ 
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This example illustrates the problem when there are two firms and two pure actions. The 
model, presented in Section [3l relates to a more general case in terms of firms and actions. 


3 The model 


Fix re G N and define N = {1, ... ,re} to be a set of indices. For every i e N, let Xi be a 
random variable with a countable support Si and finite expectation defined on the probability 
space n. We refer to Xi as a pure action. Denote the set of pure actions ... ,Xn by 
A = {Xi,... ,Xn} and denote the union of the supports of the random variables in A by 

A mixed aetion (or strategy) is a convex combination of random variables in A such that 
Q = Xir=i {qi,...,qn) G A(A)i Note that a mixed action q is a random variable 

and q{u}) = every a; G D. 

Fix a natural k ^ 2. A bonus plan (BP) of dimension A; is a function / : ^ such 

that for every r G 


2 Mr) = 1 


( 1 ) 


i=l 


si 

ifl 


and f(r) e [0,1]^. A fc-player bonus-plan problem consists of a DM and k players (or firms 
The DM, who is not familiar with the pure actions in A, publicly commits to a bonus pla: 

/, which defines a fc-player game Gf as follows. Every Player i chooses a strategy, denoted 
by cj/, based on the set of pure actions A. Denote cr-^ = (cj{,..., cr{). An a; G D is randomly 
chosen and each Player i receives ft (^a-l'{uj)), while the DM receives ^ 

Definition 1. A profile of strategies is a Nash equilibrium in Gj if 


fi cri : 0-' 


/ / 


^ E 


fi («> 


for every Player i and for every strategy q. 


A convenient way to consider a fc-player bonus-plan problem is by assuming that the 
players are playing a game induced by the DM, which implies that the DM is essentially a 
game designer or a mechanism designer. The players in Gj wish to maximize their expected 
payoffs in equilibrium, that depend on the bonus plan /, and the DM wants to maximize 
her expected payoff which is a function of the equilibrium strategies of the players m Gf. 

^We use A(B) to denote the set of probability measures on the set B. 

^We will not henceforth relate to the dimension of the function / when it is clear from the context. 
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Formally, Player i wishes to maximize E [/j where is a Nash equilibrium in G/, 

while the DM wishes the maximize E |^Xj^=i ■ 

The bouns-plan problem has some similarities to the principle-agent problem. The DM 
represents the principle who is interested in motivating her agents (e.g., investments firms) 
to produce optimal expected earnings. However, the example given in Section [2] shows that 
the appropriate method is not always the simple one. Sometimes a simple or an intuitive 
bonus plan could generate a sub-optimal result for all sides. A general result motivated by 
this example is given in the following section. 


Definition 2. A bonus plan f is optimal, if the induced k-player game Gf has an equilibrium 
such that 


E 

i=l 


= femaxHlA,]. 

ieN 


( 2 ) 


That is, / is optimal if there exist an equilibrium in Gj where the mixed actions of 
all the players are weighted averages of pure actions with maximal expected value. 


Remark 1. In order to simplify the computations and notations, we assume that 


H[Ai] > E[Xil 


(3) 


for every 2 ^ i ^ n. This implies that a bonus plan f is optimal if and only if [Xi ,..., Xi) 
is an equilibrium in Gf. 


Remark 2. For every bonus plan f and for every vector of pure actions X, it follows that 


E 




= 1. 


Therefore, the game Gf is a fixed-sum game in the sense that the sum of the expected payoffs 
of all the players is 1. For every bonus plan f and for every game Gf we can define an 
auxiliary bonus plan f* such that f*{r) = f{r)— (i,..., i) for every r 6 The induced 
game Gf* is a symmetric zero-sum k-player game. 


4 Main results 

In this section we prove the two main results of the paper. The hrst result shows that for 
every finite set of random variables A, there exists an optimal bonus plan /. The second 
result states that for every non-trivial (i.e., non-constant) bonus plan /, there exists a set A 
such that / is not optimal. 
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The combination of these results is significant when considering contracts planning. On 
the one hand, in any non-deterministic market the DM can design a contract that ensures her 
interests are kept by the agents. On the other hand, if the market is dynamic, meaning that 
the market is constantly changing in terms of possible actions, then any bonus plan which is 
not trivial can eventually lead to a suboptimal result. In other words, the only bonus plan 
that assures that the players are acting according to the DM’s preferences, is a bonus plan 
which is, paradoxically, independent of the players’ actions. 


4.1 Fixed set of actions 


The first theorem we prove is constructive. For every set of actions A, we specify an optimal 
bonus plan /. The optimal bonus plan /, that we propose, is linear in the differences of the 
earning of the players. 

We start with the simple case of a bounded set Sa- Fix a set of pure actions A. Let 
Ia be the minimal, finite, and closed interval containing all the values of Sa and let Ma 
be the absolute value of the maximal element in Ia (w.r.t the absolute value). That is, for 
every uj e H and every action W, it follows that Xi{u!) e Ia and |W(a;)| ^ Ma- Define the 
AlA-Linear Bonus Plan f by 


fi{r) 


1 T.j=iiri-rj) 
k 2k{k-l)MA ’ 
1 

fc’ 


if r 6 l\, 
if r ^ iX- 


One can verify that / is well-defined, since for every r e M^, the equality fi{r) = 1 holds 
and /(r) e [0,1]^. 


Theorem 1. For every set of pure actions A with finite support Sa, the MA-Linear Bonus 
Plan is optimal. 


Proof. We have to prove that {Xi,... ,Xi) is an equilibrium in G/, or equivalently, 
that for every Player i and for every strategy q = of Player i (when q Xi), 

the inequality E [fi (Xi,..., Xi)] < E [fi {q, Xi,..., Xi)] holds. Without loss of generality. 
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assume that i = 1, therefore 


E[h{q,X^,...,X,)] = 


< 

when the second and last equalities follow from the definition of / and the inequality follows 
from the fact that q ¥= Xi and E[q] = E EILi ^i^i\ < ■ 

The fact that {Xi,, Xi) is an equilibrium is not snrprising. It follows directly from 
the linearity of fi{r) in rj. This linearity implies that Xi is a dominant strategy for every 
player and therefore, the equilibrium is unique. 

The following theorem presents a Bounded and Linear Bonus Plan / for the general case 
(i.e., for the case that Sa is not bounded) and proves that it is optimal. 

Theorem 2. For every set of pure actions A, there exists an optimal bonus plan. 

Proof. We extend Theorem [1] for an unbounded set Sa by constructing a new optimal 
bonus plan. Let hq^i = {q, {Xi)j^i) be a vector of actions where player i chooses the mixed 
action q and every other player plays Xi. Note that for every q ^ Xi, the expected value 
E[q- Xi] = -c, < 0 for some Cg > 0. Since the expected value is negative and finite, there 
exists a real number M{q) >0 such that for every m ^ M{q) 


E [(g - ^l)l{|g-Xi|s;m}] < -y, 

(4) 

2 \l\Pr{q-X,=l)<^. 

(5) 


\l\>m 


Ineq. (jH) and Q are strict, therefore there exists an open set B{q) containing q snch 
that both inequalities hold for every g' 6 B{q) where q' A Xi. The set of mixed actions q is 
compact as it can be represented by a (n — l)-simplex, hence the set of open sets {B{q)} is an 
open cover. It is known that there exists a finite subcover Ba- Fix c = min^gs^ Cg > 0 and 
let M ^ maxggs^ M{q) be a positive number such that for every M and every q ^ X\, 

E[{q - Xi)l^\q_Xi\^m}] < ( 6 ) 


E 


hiY,qiXi,X,,...,Xi 


Ki=l 


2kMA 


{k-l)^^^^q,X,~{k-l)Xi 1 

2k{k - 1 )Ma ^ k 


2 


2=1 


- = E[fiiXu...,X,)], 
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and 


(7) 


2 |/|Pr(g-Xi = 0< 

|Z|>m 

Note that (j7]) hold for q = Xi with weak inequality. 
Dehne the Bounded-Linear Bonus Plan f by 




J fc 2k{k-l)M ’ ^ fc ^ 2k{k-l)M 

if otherwise. 


k’ 


One can verify that / is a well-defined bonus plan. 

We prove that (Wi,..., X\) is an equilibrium in the game induced by the Bounded-Linear 
Bonus Plan by showing w.l.o.g. that 


for every q ^ Xi. Note that 


0 ^ - + 


1 . 


< 


k 2k{k — 1)M k 


-2M ^q- Xi^ 2M 
\q- Xi\ ^ 2M. 


Thus, 




1 . lljAd-Xi) 


k 2k{k-l)M ) 


+ -Pr(|g-Xi| > 2M) 
k 

\1\>2M 


It follows from ([6|) and d?]) that 






k AMk 2Mk 


1 

< ^— 


\1\>2M 
C 1 


k AMk AMk fc’ 


as previously stated. 
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4.2 A universal bonus plan 

A universal bonus plan / is a non-constant bonus plan such that for every finite set of actions, 
/ is optimal. Specifically, A universal bonus plan / is a non-constant bonus plan such that 
for every set of actions Xi ,..., where i?[Ai] > for every 2 ^ j ^ n, the profile of 

actions (Ai, ..., Xi) is an equilibrium in Gf. The following theorem proves such a plan does 
not exist. 


Theorem 3 . If there are two players, a universal bonus plan does not exist. 


Proof. Let x < y < z. We first prove that fi{y,y) ^ fi{x,y). Assume to the contrary 
that fi{y,y) < fi{x,y). Let A = {Ai, A 2 } be a set of two actions Xi and X2 with a joint 
probability distribution, 


Ai\A 2 

X 

y 

X 

0 

0 

y 

1 

0 


Note that L'[Ai] > E[X 2 ]. However, 


E[h{X2,X^)] = h{x,y) > fi{y,y) = E[h{X^,Xi)l 


implying that {Xi,Xi) is not an equilibrium in Gf, since Player 1 can benefit from deviating 
to A 2 . Thus, 

fi{y,y) > fi{x,y)- ( 8 ) 


For similar reasons. 


f2{y,y) > f2{y,x). 


( 9 ) 


Next we prove that fi{x,x) ^ fi{y,x). Let p be a number in (0,1) and let A = {Ai, A2} 
be a set of two actions Ai and A 2 with a joint probability distribution. 


Ai\A 2 

X 

y 

X 

0 

p 

Z 

1 — p 

0 


A direct computation shows that 


F[/i(Ai,Ai)]-F[/i(A2,Ai)] = p{h{x,x)-h{y,x)) 

+ - fi{x,z)). 
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Recall that fi{z, z) — fi{x, z) is bounded. If fi{x, x) < fi{y, x), then there is p smaller than, 
but sufficiently close to 1, such that for every z, E[fi{Xi, Xi)] — E[fi{X2, Xi)] < 0. In other 
words, 

E[MXi,X,)]<E[MX2,Xi)]. ( 10 ) 

Now one can choose 2 to be sufficiently large, so that > E[X2]. Inequality (fTUl) implies 

that (Xi,Xi) is not an equilibrium in Gf, since Player 1 can benefit from deviating to X 2 . 
Hence, 

fi{x,x) ^ fi{y,x). ( 11 ) 


Similar reason imply that 

f 2 {x,x) ^ f 2 {x,y). (12) 

We now sum up inequalities (HI), dH), (fTTI) and (fT^ to obtain, fi{y, y) + f2{y, y) + fi{x, x) + 
f2{x,x) ^ fi{x,y) + f2{y,x) + fi{y,x) + f2{x,y). Due to Eq. ([I]) we obtain equality. Thus, 
(D, m, (ffH) and (fT^ are actually equalities. Therefore, 


fi{x,x) = fiix,y) = fi{y,x) = fi{y,y), 


and the proof is complete. ■ 

A generalization of Theorem [3] to any number of players /c ^ 3 is not trivial. In fact, the 
requirement that / not be constant is not sufficient. For example, take any non-constant 
bonus plan / : ^ such that fi{r) = ^ for every r e with at least two identical 

coordinates. In this case, for every action Xj, the profile of strategies {Xj,... ,Xj) is an 
equilibrium, as any deviation of a single player has no influence on the payoffs. On the other 
hand, requiring that all equilibria satisfy Eq. ([2]) does not hold when / is constant, as all 
profiles are equilibria. Therefore, we add a new requirement when the number of players is 
three or more. 

Definition 3. A strongly universal bonus plan f, is a bonus plan where every profile of 
actions in Gj that sustains Eq. ([2]) is an equilibrium. 

Theorem 4. If f is a strongly universal bonus plan, then every profile of actions is an 
equilibrium. 

Proof. Let / be a strongly universal bonus plan. Clearly, Theorem [3] implies that the 
result holds for the case ofA: = 2. Fixfc^3. We prove the theorem by showing that for 
every player i and for every vector of outcomes r e the coordinate fi{r) of the bonus 
plan is non-decreasing and non-increasing in r./. 
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Assume to the contrary that there exists a player i, a vector of outcomes r e and 
Wi eM such that fi{wi,r-i) > fi{ri,r-i) where Wi < Vi. Define the random variable X such 
that Pr(A = x) > 0 iff x = when 1 ^ j ^ k. Assume that Pr(A = Vi) > Pr(A = rj) for 
every j ^ i. In addition, dehne a set of i.i.d. random variables Xj ~ X where 1 ^ j ^ k. 
Dehne the vector-valued random variable (VP, A_j) by 

Pr((VP, A_i) = x) = Pr((A„ A_i) = x) Vx A r. 


and 

Pr((VP, A_i) = {wi,r.i)) = Y>i{{Xi,X_i) = r). 

Clearly, (VP, A_j) and VP are well-dehned. A direct computation shows that £'[VP] < E[Xl 
However, the vector (Aj, A_j) is not an equilibrium as player i can deviate to VP and increase 
his payoff since fi{wi,r-i) > fi{ri,r-i). Hence is non-decreasing for every i and 

every r_j. 

Now assume to the contrary that fi{ri,r-i) is strictly increasing in Xj. That is, there 
exists a player i, a vector of outcomes r e and £ M such that fi{yi,r-i) > 
where yi> ri. 

Let z, z 6 M be two real number such that z > rj > z for every 1 ^ j ^ k and let p be 
a number in (0,1). Define the random variable Y such that, with probability p, it follows 
that Pr(y = y) > 0 iS y = rj when 1 ^ j ^ k (assume that Pr(y = Xj) > Pr(y = rj) for 
every j A i), and with probability 1 —p, the random variable Y equals z. In addition, define 
a set of i.i.d. random variables Yj Y where 1 ^ j ^ k. Dehne the vector-valued random 
variable (Z,Y-i) by 

Pr((Z,Y_i) = y) = Pr((Yi,Y_i) = y) Vy A r,y, A ^ Vj, 

Pr((Z,y_0 = (y„x_,)) = Pr((yi,y_o = x), 
and if there exists a coordinate j in y e such that yj = z, then 

Pr((y,y_o = (z,y_i)) = Pr((yi,y_,) = y). 
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Clearly, (Z, ICj) and Z are well-defined. Note that 




> 


E 


[fiin, 


Y_i)l 


Wj} 


+ Myi,'r-i)Fr{(Yi,Y_i) = r) 


j/sR''; 

3i,%=S 


E 


fi{Yi,Y_i)l^Yj^z Vj} 


Yi = y) 


E [fiiYi, Y_i)] + 2 ifiU, y-i) - fi{y))PiiiYi, Y_i) = y), 
3i,%=S 


(13) 

(14) 


when Ineq. (fT3]l follows from the assumption that fi{yi,r-i) > fi{ri,r-i). The sum in Eq. 
()14p is bounded, therefore we can choose p sufficiently close to 1 (but still smaller than 1), such 
that for every z, the inequality E[fi{Z,Y-i)] > E [fi{Yi,Y-i)] holds. Taking a sufficiently 
large z and a sufficiently low z, guarantees that E[Y] > E[Z]. 

In conclusion, the vector {Yi,Y-i) is not an equilibrium as player i can deviate to Z and 
increase his payoff. Contradiction. Hence is non-increasing for every i and every 

r_j. The combination of the two results, proves that fi is independent of the coordinate. 
This implies that the expected payoff of every player i is independent of his actions and every 
profile of actions is an equilibrium. ■ 


5 Conclusions and addtional comments 


The results presented in this paper have two completing aspects: an applicative one and 
a theoretical one. On the one hand, we give a specific description of a bonus plan that 
guarantees that agents are motivated to act according to the DM’s wishes. On the other 
hand, we show that an optimal bonus plan simply does not exist, unless it is useless as a 
bonus plan (constant, meaning fixed payoffs). 

This work is based on the assumption that the DM tries to maximize her expected 
p ayoff. Although, this assumption is common in the literature, one can still follow the line 
of lHolmstrom and Milgroml (jl99ll ) and assume that the DM tries to optimize a few elements 


simultaneously. For example, maximizing the expected revenue while minimizing the risk. In 
general, one can assume that the DM has a certain preference relation over the set of actions 
and try to find a bonus plan which implements this preference in equilibria. These problems 
are left for future research. 

Another element left for future work is the generalization given in Theorem 01 This 
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generalization could be done in various ways. E.g., we can assume that always has 

some dependency on the action of player i for every r_j, and then explore whether an optimal 
bonus plan is possible under such conditions. 
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